1 lOOSOfi^'/ ,., 

By Expicss Mail # EV052763391US 



JCO/Rae'dPCT/PTQ i Q m , niff 



FORM PTO-1390 U S DFPARTMFN1 01 COMMERCE PA 1 EN 1 AND 1 RADFMARK OFHCE 

(REV 10-94) 

TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) CONCERNING A FILING 
UNDER 35 U.S.C. 371 












U.S. APPLICATION NO. 
(If known, sec 37 CFR 1 .5) 


INTERNATIONAL A ITUCA 1 ION NO 

PCT/FI00/00647 


INTERNATIONAL FILING DA I E 

14 July 2000 


- PRIORI! Y DA I E CLAIMED 

14 July 1999 



ITLE OF INVENTION 

Method For Decreasing The Processing Capacity Required By Speech Encoding And A Network Element 



APPLICANT(S) FOR DO/EO/US 

Ari LAKAN1EM1 

Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other 
information: 

1 . [x]This is a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2. [] This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371 

3. [x]This express request to begin national examination procedures (35 U.S.C. 371(f)) at any time rather than delay 

examination until the expiration of the applicable time limit set in 35 U.S.C. 371(b) and PCT Articles 22 and 
39(1). 

4. [x]A proper Demand for International Preliminary Examination was made by the 19th month from the earliest 

claimed priority date. 

5. [xJA copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. [x]is transmitted herewith (required only if not transmitted by the International Bureau). 

b. [x]has been transmitted by the International Bureau. 

c. [] is not required, as the application was filed m the United States Receiving Office (RO/US) 

6. [] A translation of the International Application into English (35 U.S.C. 371(c)(2)). 

7. [x] Amendments to the claims of the International Application under PCT Article 19 (35 U.S.C. 371(c)(3)) 
. a. [ ] are transmitted herewith (required only if not transmitted by the International Bureau). 

* b. [] have been transmitted by the International Bureau. 

c. [] have not been made; however, the time limit for making such amendments has NOT expired. 

d. [x]have not been made and will not be made. 

8. [] A translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371(c)(3)). 

9. [x]An oath or declaration of the mventor(s) (35 U.S.C. 371(c)(4)). Unexecuted 

10. [] A translation of the annexes to the International Preliminary Examination Report under PCT Article 36 (35 

U.S.C. 371(c)(5)). 

Items 11. 1o 16. Below concern other document(s) or information included: 

1 1 .[x] An Information Disclosure Statement under 37 CER 1 .97 and 1 .98. 

12. [] An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.3 1 is 

included. 

13. [xJA FIRST preliminary amendment. 

[] A SECOND or SUBSEQUENT preliminary amendment. 

14. [] A substitute specification. 

15. [] A change of power of attorney and/or address letter. 

16. [x]Other items or information (specify)- PCT Publication Sheet, Int'l Preliminary Examination Report, Int'l Search 

Report, PCT Request, Notice Concerning Submission or Transmittal or Priority Document, Letter to WIPO 
forwarding Translation of Specification from Finnish language to English language, Notice Informing the 
Applicant of the Communication of the International Application to the Designated Offices, and Notification of 
the Recording of a Change 



Form PTO-I390 (REV 10-94) 



page 1 of 2 



,1, o o 



U.S. APPI.ICA 1 ION NO (If ku U «„, sec 37 CFRI5) 


INI TRNA riONAL \PP1JCA HON NO 

PCT/FTOO/00647 


AT lORNEY'S DOCK BT NUMBER 

4925-203PUS 


i * y / ^ ^ ^ — 

1 7.[x]The following fees are submitted: 


! 


Basic National Fee (37 CFR 1.492(a)(l)-(5)): 

Search Report has been prepared by the EPO 01 JPO $890.00 

International preliminary examination fee paid to USPTO (37 CFR 1.482) $710.00 

No international preliminary examination fee paid to USPTO (37 CFR 1.482) 

but international search fee paid to USPTO (37 CFR 1.445(a)(2)) $740.00 

Neither international preliminary examination fee (37 CFR 1 .482) 

nor international search fee (37 CFR 1 445(a)(2)) paid to USPTO $1040.00 

International preliminary examination fee paid to USPTO (37 CFR I 482) 

and all claims satisfied provisions of PCT Article 33(2)-(4) $100.00 




ENTER APPROPRIATE BASIC FEE AMOUNT = 


$ 


890 


Surcharge of $130.00 for furnishing the oath or declaration later than [J 20 [] 30 months 
from the earliest claimed priority date (37 CFR 1 .492(e)). 


$ 




Claims 


Number Filed 


Number Extra 


Rate 




Total Claims 


6-20 = 




x $18.00 


$ 




Independent Claims 


2 - 3 = 




x $84.00 


$ 




Multiple dependent claim(s) (if applicable) 


+ $280.00 


$ 




TOTAL OF ABOVE CALCULATIONS = 


$ 


890 


Reduction of Vi for filing by small entity, if applicable. 


$ 




SUBTOTAL = 


$ 


890 


Processing fee of $130.00 for furnishing the English translation later than [] 20 [] 30 
months from the earliest claimed priority date (37 CFR 1 .492(f)). + 


$ 




TOTAL NATIONAL FEE = 


$ 


890 


Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by the appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property 


$ 




TOTAL FEES ENCLOSED 


$890 




Amount to be refunded: 


$ 




charged: 


$ 


a [x] One check in the amount of $ 890 to cover the above fee is enclosed. 

b [] Please charge my Deposit Account No. 03-2412 m the amount of $ to cover the above fees. A duplicate copy ol 

this sheet is enclosed. 

c. [x]The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 
overpayment to Deposit Account No. 03-2412. A duplicate copy of this sheet is enclosed. 

NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive 
(37 CFR 1.137(a) or (b)) must be tiled and granted to restore the application to pending status. 



SEND ALL CORRESPONDENCE TO 
Michael C. Stuart 

Cohen, Pontani, Liebcrman & Pavane 

551 Fifth Avenue, Suite 1210 

New York, New York 1 01 76 



Michael C. Stuart 

Registration Number: 35.698 January 10. 2002 
Tel: (212) 687-2770 



Form PTO-1390 (REV 10-94) 



Attorney Docket # 4925-203PUS 

10/630667 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re National Phase PCT Application of 

Ari LAKANIEMI 

International Appln. No.: PCT/FIOO/00647 

International Filing Date: July 14, 2000 

For: Method For Decreasing The Processing Capacity; 
Required By Speech Encoding And A Network 
Element i 

SECOND PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 
BOX PCT 

S I R: 

Prior to examination of the above-identified application please amend the 
application as follows: 

In the Specification : 

Page 1, before line 4, the title (" FIELD OF THE INVENTION ") added by the 
Preliminary Amendment filed with the application, insert the following title and paragraph: 
- PRIORITY CLAIM 

This is a U.S. national stage of PCT application No. PCT/FIOO/00647, filed on July 14, 
2000. Priority is claimed on that application and on Application No. 991605, filed in Finland on 
July 14, 1999.- 
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By Express Mail # EL793472640US June 5, 2002 

REMARKS 

This preliminary amendment is presented to perfect the claim for foreign priority. 



No new matter has been added. Early examination and favorable consideration of the above - 
identified application is earnestly solicited. 

Any additional fees or charges required at this time in connection with the 
application may be charged to our Patent and Trademark Office Deposit Account No. 03-2412. 



Respectfully submitted, 

COHEN, PONTANI, L1EBERMAN & PA VANE 



Michael C. Stuart 

Reg. No. 35,698 

551 Fifth Avenue, Suite 1210 

New York, N.Y. 10176 

(212) 687-2770 




5 June 2002 
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Attorney Docket # 4925-203PUS 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re National Phase PCT Application of 

Ari LAKANIEMI 

International Appln. No.: PCT/FI00/00647 

International Filing Date: 14 July 2000 

For: Method For Decreasing The Processing Capacity; 
Required By Speech Encoding And A Network' 
Element 

PRELIMINARY AMENDMENT 

Assistant Commissioner for Patents 
Washington, D.C. 20231 
BOX PCT 

S I R: 

Prior to examination of the above-identified application please amend the 
application as follows: 
IN THE SPECIFICATION: 

Page 1, before line 4, the paragraph beginning with "In general,", insert the following 

title: 

- FIELD OF THE INVENTION 
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Page 1, before line 8, the paragraph beginning with "In the arrangement", insert the 
following title: 

BACKGROUND OF THE INVENTION — 

Page 4, before line 5, the paragraph beginning with "It is an objective", insert the 
following title: 

SUMMARY OF THE INVENTION 

Page 5, before line 17, the paragraph beginning with "In the following", insert the 
following paragraph and title: 

-Other objects and features of the present invention will become apparent from the following 
detailed description considered in conjunction with the accompanying drawings. It is to be 
understood, however, that the drawings are intended solely for purposes of illustration and not as 
a definition of the limits of the invention, for which reference should be made to the appended 
claims. 

BRIEF DESCRIPTION OF THE DRAWINGS -. 

Page 6, before line 1, the paragraph beginning with "In the figures,", insert the following 

title: 

DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EMBODIMENTS 

-2- 
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Page 1 1 , after the last line, insert the following paragraph: 
—Thus, while there have been shown and described and pointed out fundamental novel features of 
the present invention as applied to a preferred embodiment thereof, it will be understood that 
various omissions and substitutions and changes in the form and details of the devices described 
and illustrated, and in their operation, and of the methods described may be made by those skilled 
in the art without departing from the spirit of the present invention. For example, it is expressly 
intended that all combinations of those elements and/or method steps which perform substantially 
the same function in substantially the same way to achieve the same results are within the scope of 
the invention. Substitutions of elements from one described embodiment to another are also fully 
intended and contemplated. It is the intention, therefore, to be limited only as indicated by the 
scope of the claims appended hereto.-. 

Page 12, line 1, delete "Claims" and insert therefor —What is claimed is:—. 

IN THE CLAIMS : 

Amend claim 1 to read as follows: 

1. A method for matching two different encoding methods in a telecommunication 
system using a discontinuous transmission method between the transmitter and receiver, 
characterized in that the signal path the signals transmitted by the transmitter are made suitable for 
the receiver so that 

-3- 
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- for a data frame, at least one information parameter containing at least two content 
identifiers is formed of the data parameters received (101), 

- data corresponding to the original data is synthesized from the original data parameters 
(101) of the received frames, 

- the synthesized data is transmitted for recording with an encoding method suitable for the 
receiver, 

- during recording, the data parameters (107) of at least some frames are updated on the 
basis of at least one value of said content identifiers of the information parameter, and 

- on the basis of the value of at least one other content identifier of the information 
parameter, the frames to be transmitted to the receiver are selected from all recorded data frames. 
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REMARKS 

This preliminary amendment is presented to place the application in proper form 
for examination. No new matter has been added. Early examination and favorable consideration 
of the above-identified application is earnestly solicited. 

Attached hereto is a mark-up version of the changes made to the specification and 
claims by the current amendment. The attached page is captioned Version with markings to 
show changes made " . 

Any additional fees or charges required at this time in connection with the 
application may be charged to our Patent and Trademark Office Deposit Account No. 03-2412. 



Respectfully submitted, 

COHEN, PONTANI, LIEBERMAN & PA VANE 




Michael C. Stuart 

Reg. No. 35,698 

551 Fifth Avenue, Suite 1210 

New York, N.Y. 10176 

(212) 687-2770 



10 January 2002 
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AMENDMENTS TO THE SPECIFICATION AND CLAIMS SHOWING CHANGES 

The claims have been amended as follows: 

1. A method for matching two different encoding methods in a telecommunication 
system using a discontinuous transmission method between the transmitter and receiver, 
characterized in that the signal path the signals transmitted by the transmitter are made suitable for 
the receiver so that 

- for a data frame, at least one information parameter containing at least two content 
identifiers is formed of the data parameters received (101), 

- data corresponding to the original data is synthesized from the original data parameters 
(101) of the received frames, 

- the synthesized data is transmitted for recording with [a] an encoding method suitable 
for the receiver, 

- during recording, the [at least some] data parameters (107) of at least some [the] 
frames are updated on the basis of at least one value of said content identifiers of the information 
parameter! and 

- on the basis of the value of at least one other content identifier of the information 
parameter, the frames to be transmitted to the receiver are selected from all recorded data frames. 
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METHOD FOR DECREASING THE PROCESSING CAPACITY REQUIRED BY SPEECH ENCODING AND A 
NETWORK ELEMENT 



In general, this invention relates to speech encoding and decoding used in digital 
5 radio systems and particularly a method by which the processing capacity required 
can be reduced in a telecommunication system using discontinuous transmission 
between a transmitter and a receiver. 

In the arrangement used in modern speech encoding techniques, speech codecs 
process the speech signal in periods, which are called speech frames or just frames. 

10 Here the term codec means the arrangement by which speech can be encoded. 
Preferably it comprises an encoding algorithm and means for implementing it on a 
speech signal. A typical frame length of a speech codec is 20 ms, which corresponds 
to 160 samples at a sampling frequency of 8 kHz. The speech frames generally vary 
from 10 ms to 30 ms. Each speech frame is processed in a speech encoder, and 

15 certain encoding parameters are formed of these frames and transmitted to the 
decoder. The decoder forms a synthesized speech signal by means of those 
parameters. 

In digital cellular radiotelephony systems, such as the GSM (Global System for 
Mobile communications), a discontinuous transmission method (DTX, 

20 Discontinuous Transmission), which is also defined in many speech encoding 
standards, is generally used. The discontinuous transmission method generally 
means that the transmitter part of the terminal is switched off* for most of the time 
when the user does not speak, i.e., when the terminal has nothing to transmit. The 
purpose of this is to reduce the average power consumption of the terminal and to 

25 improve the utilization of radio frequencies, because transnutting a signal, which 
carries just silence, causes unnecessary interference with other simultaneous radio 
connections. According to some research, only 40% of the data transmitted contains 
actual speech data. The rest is silence or background noise. Thus a discontinuous 
transmission method, in which frames that do not contain actual speech are 

30 removed, provides many advantages. Firstly, the processing load of the encoder can 
be reduced, because the "redundant" frames are not encoded at all. Secondly, when 
the number of frames to be transmitted is reduced, the power consumption of the 
device is also reduced. Furthermore, the loading of the network can be reduced, 
when "redundant" frames are removed from the data to be transmitted. 
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An operation called Voice Activity Detection (VAD) is used for speech detection in 
a discontinuous transmission method. The voice activity detection takes place e.g. 
so that a voice activity detector is arranged to examine each frame to be transmitted, 
and on the basis of the examination it is concluded whether the frame contains 
5 speech data or not. The operation of the voice activity detector is based on its 
internal variables, and the output of the detector is preferably one bit, which is 
called the VAD flag. Value 1 of the VAD flag then corresponds to a situation where 
there is speech to be processed, and value 0 a situation where the user is silent. Thus 
when the flag is up, the frame contains speech data and it can be transmitted. 
10 Correspondingly, when the VAD flag is down, the frame can be entirely removed. 

The discontinuous transmission method has one disadvantage. When the trans- 
mission is interrupted, the background noise that exists in the frames that contain 
speech, also disappears. This may cause a very unpleasant effect at the receiving 
end. In a discontinuous transmission method, the interruption of the transmission 

15 may take place quickly and at irregular intervals, whereby the receiver experiences 
the quickly changing voice level as disturbing. Especially when the level of the 
background noise is high, the interruption of the transmission may even make it 
more difficult to understand the speech. Therefore it is advantageous to produce in 
the receiver some synthetic noise, which resembles the background noise of the 

20 transmitter and which is called Comfort Noise (CN), even when no frames are 
transmitted to the receiving end. 

The production of comfort noise takes place e.g. so that at first the level of the 
actual background noise is estimated by means of some frames that contain 
background noise when the value of the VAD flag changes from one to zero. The 

25 element that decides about the discontinuous transmission mode transmits these few 
frames to the receiver as speech frames. This period when the speech burst has 
ended, but the transmission of speech frames has not yet been switched off, is called 
a hangover period. The frames that are transmitted during the hangover period, only 
contain data caused by background noise, whereby the parameters of the comfort 

30 noise can be safely determined by means of these frames. A Silence Descriptor 
(SID) frame is advantageously used for transmitting the comfort noise parameters to 
the receiver. The values of the parameters of the SID frames are updated regularly, 
and at least when the level of the background noise changes. In practice, the SID 
frame can be used in at least the following two ways. Firstly, a SID frame is 

35 transmitted immediately after the hangover period. After this, SID frames are 
transmitted regularly. An arrangement like this is used in the speech codecs of the 
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GSM system, for example. Another possibility is to transmit a SID frame 
immediately after the hangover period, but to transmit the next SID frame only 
when the encoder detects a change in the characteristics of the background noise. 

In an ideal situation, both the transmitting terminal and the receiving terminal use 
5 the same speech encoding method. In a case like this, the encoded speech need not 
be changed suitable for some other encoding method. However, in practice this is 
often necessary. In a situation like this, the encoded speech data is encoded 
differently by means of a transcoder. The transcoder can be located at any point of 
the signal path between the transmitter and the receiver. 

10 The prior art transcoders are typically implemented in a manner shown in Fig. 1. 
The input of the transcoder consists of the input parameters 101 transmitted by the 
transmitter. The discontinuous transmission reception block 102 of the transcoder 
has been arranged to estimate whether the parameters received contain speech or 
comfort noise. Information about the contents of the frame is transmitted to the 

15 speech encoder 104 by means of the SP (Speech Present) flag 103, for example. In 
addition, the frame is also transmitted to the speech decoder 104. The decoding 
method of the frame depends on the value of the SP flag 103. After decoding, the 
synthesized speech or comfort noise is transferred to the internal buffer circuit 105 
of the transcoder. The recoding of the contents of the buffer circuit 105 is started 

20 when the buffer circuit 105 contains a sufficient amount of data. When data is 
recoded, the voice activity detector 106 is used at first to examine whether the frame 
contains speech or background noise. On the basis of the quality of the data 
contained by the frame, the voice activity detector 106 forms a VAD flag 107 and 
gives it a value. In addition, it transmits the value of the VAD flag 107 and the 

25 frame that arrived to it as such forward to the speech encoder 108. The value of the 
VAD flag 107 is also given to the transmitter unit 110 of the transcoder. The speech 
encoder 108 processes the data coming to it and transmits the parameters 109 of the 
encoded data to the transmitter unit 110. The transmitter unit 110 checks on the 
basis of the values of the VAD flags 107 it received which frames are to be 

30 transmitted to the network and which not. In order to make the receiver block of the 
tenninal receiving the signal also to maintain the generation of comfort noise, some 
frames containing comfort noise can also be transmitted to the receiver, and the 
parameters of these frames containing comfort noise have been updated in the 
speech encoder 108, when required. 

35 The problem in the prior art solutions is the fact that the voice activity detector ii 
used twice. For the first time it is used in the encoder circuit of the transmitting 
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terminal and then again in the transcoder. In practice, this means that unnecessary 
computation procedures are carried out when speech data is transmitted, because in 
prior art solutions the same voice activity detection procedure is performed twice on 
the same data flow. 

5 It is an objective of this invention to eliminate the above mentioned problem of the 
prior art. 

The objectives of the invention are achieved by implementing a transcoder 
arrangement, by means of which the quality of the contents of the frame can be 
checked in a simple manner, whereby excessive use of processing capacity is 
10 avoided. 

The method according to the invention for matching two different encoding methods, 
in a telecommunication system using a discontinuous transmission method between 
the transmitter and receiver is characterized in that in the signal path the signals 
transmitted by the transmitter are made suitable for the receiver so that 

15 - for a data frame, at least one information parameter containing at least two content 
identifiers is formed of the data parameters received, 

- data corresponding to the original data is synthesized from the data parameters of 
the received frames, 

- the synthesized data is transmitted for recoding with an encoding method suitable 
20 for the receiver, 

- during recoding, at least some data parameters of the frames are updated on the 
basis of at least one value of the content identifiers and 

- on the basis of the value of at least one other content identifier, the frames to be 
transmitted to the receiver are selected from all recoded data frames. 

25 The network element according to the invention, which is arranged to match two 
different encoding methods in a telecommunication system using a discontinuous 
transmission method between the transmitter and receiver is characterized in that in 
the signal path the signals transmitted by the transmitter are arranged to be made 
suitable for the receiver by a network element, which comprises 

30 - means by which at least one information parameter containing at least two content 
identifiers is formed for a data frame of the data parameters received, 
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- means by which synthesized data corresponding to the original contents of the data 
is formed of the data parameters of the received frames, 

- means for recoding the synthesized data with an encoding method suitable for tht 
receiver, 

5 - means for updating the data parameters of at least some frames on the basis of at 
least one value of the content identifiers and 

- means for selecting the frames to be transmitted to the receiver on the basis of at 
least one other value of the content identifiers from all the recoded data frames. 

Preferred embodiments of the invention are described in the dependent claims. 

10 According to the invention, the procedure for carrying out voice activity detection is 
removed from the signal path, preferably from the transcoder. By an arrangement 
like this, the structure of the transcoder can be simplified and processing capacity 
can be saved for other purposes. Information about the contents of the frames is 
preferably transmitted by means of at least one information parameter, which 

15 comprises at least two different content identifiers, to the element which makes the 
decision about the frames to be transmitted forward. 

In the following, the invention will be described in more detail with reference to the 
accompanying drawings, in which 

Figure 1 is a block diagram of a prior art transcoder, 

20 Figure 2 shows a transcoder according to one embodiment of the invention, 

Figures 3a and 3b show some possibilities of using the flag bits of a transcoder 
according to the invention to indicate the contents of the frames, 

Figure 4 shows a first network arrangement, in which a transcoder according to 
the invention is applied, 

25 Figure 5 shows another network arrangement, in which a transcoder according to 
the invention is applied, and 

Figure 6 shows a third network arrangement, in which a transcoder according to 
the invention is applied. 
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In the figures, the same reference numbers and markings are used for corresponding 
parts. Figure 1 was discussed above in connection with the description of the prior 
art. 

Figure 2 shows a preferred embodiment of a transcoder according to the invention. 
5 The transcoder receives as its input the parameters 101 formed of the speech signal 
at the transmitting end. The reception block 102 of the transcoder processes the 
received data and forms an SP flag 103 thereof. The SP flag 103 indicates whether 
the received frame contains speech data or comfort noise. Here speech data is thus 
either an actual speech signal or background noise. For example, when the value of 

10 the SP flag 103 is 1, the frame contains speech data or background noise, and when 
the value of the SP flag 103 is 0, the frame contains comfort noise. A frame 
containing comfort noise is called a SID frame here according to the above 
description. In addition to the SP flag 103, the reception block 102 determines the 
HO flag 201 from the received frames. The HO flag 201 can be given the value 1, if 

15 the frame is the first one after the hangover period, otherwise the value is 0. It is 
clear to a person skilled in the art that the HO flag indicates that background noise 
has been transmitted in the transmission during the hangover period, by means of 
which background noise the parameters contained by the SID frames can be 
updated. The SP flag 103 and the HO flag 201 are preferably transmitted to the 

20 buffer circuit 105. The value of the SP flag 103 of a certain frame is also transmitted 
to the decoder 104 together with the data parameters contained by the frame. The 
decoder 104 is arranged to decode the data parameters of the frame that arrived to it 
into synthesized speech data and to transmit the synthesized speech frame or 
comfort noise frame to the internal buffer circuit 105. The decoding method used by 

25 the decoder 104 is preferably dependent on the value of the SP flag 103. The speech 
encoder 108 after the buffer circuit 105 is arranged to read the HO flag 201, SP flag 
103 and the synthesized data frame related to them, which are in the buffer circuit 
105. The speech encoder 108 starts the recoding of the data e.g. in a corresponding 
manner as in the prior art solutions, i.e. when adequate data has been fed to the 

30 buffer circuit 105. The speech encoder 108 can also update the data parameters of 
the comfort noise contained by the SID frames. The speech encoder 108 transmits 
the parameters 107 formed of the data and the SP flag 103 to the transmitter unit 
1 10. The transmitter unit 110 checks the value of the SP flag 103 of each frame and 
transmits forward at least the parameters of the frames which contain speech data. 

35 Preferably, in addition to these frames, some frames which contain comfort noise 
parameters are transmitted to the receiver so that the receiver can use them to 
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minimize unpleasant reception effects. It is clear to a person skilled in the art that 
the decoder 104 and the encoder 108 can be arranged to use different codecs. 

It has been described above that the two flags, the SP flag 103 and the HO flag 201 
are separate content identifiers, which can be used to indicate the type of data 
5 contained by each frame, for example. It is clear to a person skilled in the art that 
the information contained by the content identifiers can also be gathered under one 
parameter. A parameter like this may be called an information parameter, for 
example, and it may be a hexadecimal number or the like. In the information 
parameter arrangement, the first bit of the value of the parameter, for example, 

10 indicates the value of the SP flag 103 and the second bit the value of the HO flag 
201, and the values of these bits can be changed independently of each other. The 
information parameter can thus have one value, and the values of different content 
identifiers can be found out by examining different parts of the value. It is also clear 
to a person skilled in the art that values of other corresponding flags can also be 

15 included in the information parameter when required, which values may be needed 
for other purposes in speech encoding, for example. The information parameter can 
belong to any number system or the like, which is suitable for the above mentioned 
purpose. 

Fig. 3 a shows in the form of a timing diagram the modes of the content identifiers 

20 used in the invention, i.e. the SP flag 103 and the HO flag 201, depending on the 
contents of the frame. In the exemplary embodiment shown here, the first three 
frames contain speech data, whereby the value of the SP flag 103 is 1. In this 
embodiment, these frames are followed by a hangover period, which lasts for four 
frames altogether, and also then the value of the SP flag 103 is 1. During the 

25 hangover period, the transmission has not yet been interrupted, although the speech 
burst has ended. Background noise is advantageously transmitted in the frames, by 
means of which possible new parameters can be defined for the comfort noise 
formed of the background noise. It is clear to a person skilled in the art that the HG 
flag 201 can be advantageously used to define for the speech encoder 108 when 

30 there is a hangover period after the frames that contain actual speech data. The 
frames that belong to this hangover period contain background noise, and on the 
basis of the information contained by these frames, the comfort noise parameters of 
the SID frames can be updated. During the transmission of the SID frames, the 
values of the SP flag 103 and the HO flag 201 are zero. It is clear to a person skilled 

35 in the art that when frames that contain some data, such as speech or background 
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noise, come to the signal to be transmitted, the flags rise to the correct values 
according to the description above. 

Fig. 3b shows a tirning diagram of another arrangement according to the invention, 
in which the modes of the SP flag 103 and the HO flag 201 are arranged to be 
5 settled differently than in the case of Fig. 3a. In this exemplary case, the first three 
frames contain speech data, whereby the value of the SP flag 103 is 1. In this 
embodiment, these frames are followed by a hangover period, which lasts for four 
frames altogether, and also then the value of the SP flag 103 is 1. During the 
hangover period, the transmission has not yet been interrupted, although the speech 

10 burst has ended. Background noise is advantageously transmitted in the frames, by 
means of which possible new parameters can be defined for the comfort noise 
formed of the background noise. In this exemplary embodiment, the HO flag 201 is 
arranged to rise when the first frame of the hangover period has its turn of 
transmission. The identification of the first frame of the hangover period can be 

15 arranged in the receiver block 102, for example. In this exemplary embodiment the 
HO flag 201 is also arranged to be kept up until the first SID frame after the 
hangover period. It is clear to a person skilled in the art that the modes of the flags 
mentioned above can be arranged such that they are best suited for each application 
in which the flags are used. 

20 The arrangement discussed above provides clear advantages as compared to the 
prior art solutions. Generally it is obvious that the algorithms used for voice activity 
detection are often very complicated and thus very heavy to perform. By skipping 
one extra voice activity detection, signal processing as a whole can be simplified 
and processing capacity can be saved for other operations. The arrangement 

25 according to the invention is particularly advantageous in a situation where more 
than one transcoders have been integrated in one apparatus. In that case, the total 
saving of processing capacity may be substantial. According to some tests, in the 
case of a Full Rate (FR) codec used in the GSM system, for example, the reduction 
of one determination of voice activity detection has substantially reduced the 

30 complexity of processing. 

Another advantage provided by the arrangement according to the invention is also 
related to simpler implementation. Namely, although the voice activity detection is 
the same with each codec, there may be differences in the way that the voice 
activity detector is implemented. In prior art arrangements it is possible that the 
35 comfort noise produced by a certain codec can be interpreted as speech in the voice 
activity detector of another codec, in which case the system is unnecessarily loaded. 
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Especially it has to be noted that the codecs often encode frames that are classified 
as noise or the like in a simpler manner than frames that are classified as speech. 
Thus if a frame that contains noise is classified as speech, a larger amount of 
processing capacity is used for mis frame, and the process becomes heavier. By 
5 leaving the voice activity detection out from the transcoder, problems like this, 
which result in the use of unnecessarily high processing power, can be avoided. 

In the above description of the invention it has been assumed that the frame times in 
different codecs are the same. The arrangement according to the invention can 
advantageously also be used in a case where the frame times between different 

10 codecs are different. Let us assume, by way of example, that codec A with a frame 
time of 20 ms, for example, has been used for the data corning to the transcoder. 
The system to which the data is to be transmitted, uses codec B with a frame time of 
30 ms, for example. In an arrangement according to the invention, in a case like this 
the matching of the frame times can be implemented by, for example, arranging the 

15 SP and HO flags at intervals of 10 ms in the data in the buffer circuit 105. Thus, 
when the data of codec A is changed into data of codec B, the decoder writes two 
SP and HO flags in the buffer circuit 105 for each frame. Correspondingly, when 
the speech encoder reads data from the buffer circuit 105, it preferably reads three 
SP and HO flags per frame, or 30 ms altogether. On the basis of these three pairs of 

20 flags, the transcoder classifies the new frame either as speech or noise and gives the 
SP flag a value based on the classification. At the simplest, the classification may be 
based on the criterion that if at least two of the SP flags are up, the value of the new 
SP flag is also 1. It is clear to a person skilled in the art that other possible solutions, 
such as different combinations of the SP and HO flags can also be used in the 

25 classification. If the transcoder operates in the other direction, it is clear that the 
decoder writes three pairs of flags in the buffer circuit, of which the speech encoder 
preferably reads two pairs of flags per frame. It is clear to a person skilled in the art 
that the flags can also be arranged in the data flow with different intervals than 
those mentioned above. Preferably the interval is such that the intervals of the 

30 frames of codec A and codec B are both divisible by the interval. 

It is clear to a person skilled in the art that the hangover period, which has an effect 
on the value of the HO flag, is dependent on the codec. For example, the hangover 
period of an FR codec of the GSM system is four frames of 20 ms, whereas in the 
codec presented in the standard ITU-T G. 723.1, for example, the hangover period is 
35 six frames of 30 ms. With the method according to the invention, possible problems 
caused by the lengths of different hangover periods can be avoided. For example, if 
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the hangover period of codec A is temporally longer than the hangover period 
produced by codec B, there are no problems, because the speech encoder can 
remove the extra portion of the hangover period when required. On the other hand, 
if the hangover period of codec A is temporally shorter than the hangover period of 
5 codec B, the hangover period can be increased in the speech encoder, when 
required. This can be implemented e.g. by using the same frames containing comfort 
noise to new frames during the hangover period. 

In the next passage, the application of an arrangement according to the invention in 
a mobile communication network, such as the GSM network, will be discussed. The 
10 transcoder is preferably located between the terminals as connected to a network 
element. In the GSM network, for example, there has been arranged a separate 
network element called TRAU (Transcoder/Rate Adaptor Unit). Generally speaking, 
the task of the TRAU unit is to match networks using different signals. This means, 
for example, that the signal transfer rates are adapted for the systems. In addition, 

15 speech is recoded in the TRAU to make it suitable for transmission to a network 
using another speech encoding system. Figure 4 shows the location of a TRAU 305 
according to a preferred embodiment of the invention in a mobile communication 
network. This TRAU 305 comprises means 308 for processing the received speech 
parameters so that an SP flag can be determined from the parameters to indicate 

20 whether the received frame contains speech parameters or comfort noise parameters. 
In addition, TRAU 305 comprises means 308, by means of which the HO flag can 
be determined from the received parameters to indicate the first frame after the 
hangover period. Furthermore, TRAU 305 comprises means 309 for decoding the 
speech with a codec agreed on in advance, for example. TRAU 305 also comprises 

25 means 310, to which the synthesized speech data and the SP and HO flag can be 
temporarily moved. In addition, TRAU 305 comprises means 311, by which said 
information can be read from the buffer circuit and according to the information be 
recoded by some other codec, and by which means 3 1 1 the parameters of frames 
containing comfort noise can be updated, when required. Furthermore, TRAU 305 

30 comprises means 312, to which the parameters of the encoded data and the SP flag 
can be moved and in which means 3 12 the frames to be transmitted forward can be 
selected on the basis of the value of the SP flag, for example. According to a 
preferred embodiment, TRAU 305 transmits forward only the frames that contain 
speech data. It is clear to a person skilled in the art that the means presented can be 

35 understood as a microprocessor circuit or the like, which implements the operations 
presented above by means of inputted programs, for example. Preferably the 
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microprocessor is provided with memory, in which the speech data and the values 
of the flags, for example, can be temporarily saved. 

The TRAU 305 shown in Fig. 4 is located in connection with a Base Transceiver 
Station (BTS) 304 of the mobile communication network. Fig. 4 also shows a Base 
5 Station Controller (BSC) and a Mobile Switching Centre (MSC) of the mobile 
communication network. It is clear to a person skilled in the art that the network 
elements are separate operational units, as shown by lines 301, 302 and 303 in Fig. 
4. Fig. 5 shows corresponding network elements. In this exemplary embodiment, 
TRAU 305 is located in the immediate vicinity of the base station controller 306. 

10 Fig. 6 shows a third possibility of locating TRAU 305 in connection with the mobile 
switching centre 307 as a separate operational unit. It is clear to a person skilled in 
the art that TRAU 305 can also be located in other possible network elements. 
Network elements of the GSM system have been used as examples in this 
description when discussing how a transcoder according to the invention can be 

15 placed in the network topology. It is clear that a transcoder according to the 
invention can also be placed in other network elements than TRAU 305 and also in 
other systems than the GSM to perform corresponding operations as those presented 
here. 

It is clear to a person skilled in the art that the terms used above have been used as 
20 examples, and their sole purpose is to clarify the application of a method according 
to the invention. The arrangement according to the invention can also be used in 
other systems than the GSM. Particularly advantageously the method presented 
above is applied in any system which encodes and decodes speech, within the scope 
defined by the attached claims. 
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Claims 

1. A method for matching two different encoding methods in a tele- 
communication system using a discontinuous transmission method between the 
transmitter and receiver, characterized in that in the signal path the signals 

5 transmitted by the transmitter are made suitable for the receiver so that 

- for a data frame, at least one information parameter containing at least two content 
identifiers is formed of the data parameters received (101), 

- data corresponding to the original data is synthesized from the data parameters 
(101) of the received frames, 

10 - the synthesized data is transmitted for recoding with a encoding method suitable 
for the receiver, 

- during recoding, at least some data parameters (107) of the frames are updated on 
the basis of at least one value of said content identifiers of the information 
parameter and 

15 - on the basis of the value of at least one other content identifier of the information 
parameter, the frames to be transmitted to the receiver are selected from all recoded 
data frames. 

2. A method according to Claim 1, characterized in that the data parameters 
(107) of the frames to be updated are data parameters that describe background 

20 noise. 

3. A method according to Claim 1, characterized in that the value of at least one 
of said content identifiers of the information parameter comprises information abou* 
the first frame after a hangover period. 

4. A method according to Claim 1, characterized in that the value of at least one 
25 other of said content identifiers of the information parameter comprises information 

about the contents of the frame. 

5. A network element, which is arranged to match two different encoding 
methods in a telecommunication system using a discontinuous transmission method 
between the transmitter and receiver, characterized in that in the signal path the 

30 signals transmitted by the transmitter are arranged to be made suitable for the 
receiver by a network element, which comprises 
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- means (308) by which at least one information parameter containing at least two 
content identifiers is formed for a data frame of the data parameters received (101), 

-means (309) by which synthesized data corresponding to the original contents of 
the data is formed of the data parameters (101) of the received frames, 

5 - means (3 1 1) for recoding the synthesized data with an encoding method suitable 
for the receiver, 

- means (311) for updating the data parameters of at least some frames on the basis 
of at least one value of the content identifiers of said information parameter and 

- means (312) for selecting the frames to be transmitted to the receiver on the basis 
10 of at least one other value of the content identifiers of the information parameter 

from all the recoded data frames. 



6. A network element according to Claim 5, characterized in that it is a 
Transcoder/Rate Adaptor Unit (TRAU) (305). 
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